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Bacillus anthraciSy a jjrani-positive, spore-forming bacterium, is the etiological agent of anthrax. The gene 
coding for the S-layer protein (sap) was cloned on two contiguous fragments in Escherichia coli, and the 
complete sequence of the structural gene was determined. The protein, Sap, is composed of S14 residues, 
including a classical prokaryotic 29-amino-acid signal peptide. The mature form has a calculated molecular 
mass of 83.7 kDa and a molecular mass of 94 kDa on a sodium dodecyl sulfate-polyacrylamide gel. Sap 
possesses many charged residues, is weakly acidic, and contains only 0.9% methionine and no cysteine residues. 
The N-terminal region of Sap shares sequence similarities with the Acetogenium kivui S-layer protein, the 
Bacillus brevis middle wall protein, the Thermotoga maritima Ompa protein, and the Bacillus thuringiensis 
S-layer protein. Electron microscopy observations showed that this S-layer is not observed on B. anthracis cells 
in which sap has been deleted. 



Bacillus anthracis, the causative agent of anthrax, is a gram- 
positive, spore-forming bacterium. Fully virulent bacilli are 
both capsulated and toxinogenic. The two toxins (lethal and 
edema toxins) are encoded by the virulence plasmid pXOl, 
and capsule synthesis is dependent on the presence of a second 
virulence plasmid, pX02. When the capsule is absent, the cell 
wall of B. anthracis appears layered and is composed of small 
fragments displaying a highly patterned ultrastructure (8). Holt 
and Leadbetter (9) previously described a hexagonal lattice on 
the surface of vegetative cells of B. anthracis. This cell surface 
structure most likely represents what is called an S-layer, with 
p6 symmetry and a center-to-center spacing of the particles of 
7 to 10 nm. 

S-layers, or surface arrays, have been found to be the out- 
ermost component of many archaebacteria and eubacteria. In 
most cases, single proteins compose these structures. When 
present, they represent 5 to 10% of the total cell protein, 
implying that their synthesis is energy-consuming for the bac- 
terium. The fact that S-layers are found ubiquitously suggests 
that they play vital roles in the interaction between the cell and 
its environment. It has been suggested that the S-layer is an 
important virulence factor for bacteria such as Aeromonas sal- 
monicida, Campylobacter fetus, and Rickettsia spp., protecting 
against complement killing, facilitating binding of the bacte- 
rium to host molecules, or enhancing its ability to associate 
with macrophages (see reference 21), 

Unlike most S-layers, that of B. anthracis is not the outer- 
most component of the virulent bacilli since they are encapsu- 
lated; Azotobacter spp. are another of the rare examples of 
bacteria possessing both a capsule and an S-Iayer. The S-layer 
may have an important function in linking the capsule to the 
peptidoglycan wall or controlling the exchange of molecules 
with the environment. Bacillus thuringiensis, a closely related 
entomopathogen bacillus, possesses an S-layer (12). The B. 
thuringiensis S-layer is composed of linear arrays of small par- 
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tides arranged with p2 symmeti^, the constituent being a 91.4- 
kDa protein. Another bacillus, Bacillus brevis, has an unusual 
S-layer, since the bacterium is covered by two S-layer proteins, 
the outer wall protein (molecular mass, 103.7 kDa [25]) and 
the middle wall protein (MWP) (molecular mass, 114.8 kDa 
[24]). Transcription of 'the corresponding operon is initiated 
from multiple and tandemly arranged promoters (1). 

In order to study the B. anthracis S-layer, we decided to 
analyze its composition. We hypothesized that a major cell 
protein with a molecular mass approximately equal to 94 kDa, 
which is often observed in high abundancy in culture superna- 
tant fluids of most strains of B. anthracis, was the component of 
the S-layer. Here, we describe the cloning and sequencing of 
the gene encoding the y4-kDa protein. The relationship be- 
tween this protein and the surface array component described 
by Holt and Leadbetter (9) was determined by constructing a 
sap deletion mutant and analyzing it by electron microscopy. 

MATERIALS AND METHODS 

Bacterial strains, iiicdiu, anil growth conditiuns. Escherichia coli TGI (13) was 
used as a host for plasmid constructions. E. coli .lM83(pRK24) (23) was used as 
a host for ihc mating experiments with B. anihracis KPIO (18). The B. anthracis 
plasmidlcss strain, 913 I, was obtained by eurinti RP3! of pXOJ (19). All these B. 
anihracis strains are derivatives of the Sterne strain, i.e., they are pXOl"*^ 
pX02~. Conjugative transfer from E. coli lo B. anihracis was as described by 
Pezard ct al. (18). Double crossover events were screened for as described by 
Cataldi et al. (3). E. coH was grown in L broth or on L-agar plates (14). B. 
anthracis was grown in brain heart infusion medium (Difco Uiboratorics) or in 
SPY medium 160 mM K^HPO^, 4.5 niM KH2PO4, l5mM (NH4)2S04, iO mM 
MgS04, 2.4 mM st>diuni citrate, 0.2% (wi/vot) glucose, 0.2% (wt/vol) yeast 
extract ). Spectinomycin was usett al hO |uLg/nil for IS. anthracis and £", coli, and 
ampicillui was used at 100 |Jig/ml for E. coH. 

ONA manipulation.s. Methods for the i.soiation and manipulation of recombi- 
nant DNA were as described by Manialis et al. (13). Chromosomal DNA was 
prepared as described by Fouet and Sonenshcin (6). DNA sequencing was 
carried out using Sequenase (U.S. I3iochemicals) with either the primers pro- 
vided or an oligomer hybridizing to a determined intragenic sequence, 

OUgudeuxynuek'utidc probes, DNA lilirarics, and plasniids. The two t)ligode- 
oxynucleotide probes used were SEAGI |5'-GC(TA)GGIAAAAC(TA)TTTC 
C(TA)GATGTlCC(TA)GC (TA)GAT-3'] and SHAGl [5'-'ITC(AT)GTTTT 
(ATC)CC(ATc;)ACI'rA l I fAlLjACATC-.V | . Tn LU ii i i asi lu r i OAGl, the SEAG2 
oligodeoxynucleotidc probe w:is devised compiemeiUar\' to that directly deduced 
from the amino acid .sequence, the two probes being therefore convergent. DNA 
libraries were obtained by ligaling tlie product uf complete ///ndllT and EcoKV 
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FIG. 1. SDS-PAGE analysis oC culture supcrnaiaiH.s (1 nil) Tnim dillcruni 
strains ol !i. aiuhracis. Lane 1, molecular mass markers; lane 2, strain RBA2 
(iVvo/j); lane 3, strain 9131. 



digestions of the chromosome to pUClV previously digested with ///Vidlll and 
Snia\, rcspccliveiy, and dephosphorylated. These enzymes were cht)scn accord- 
ing to the results obtained alter hyhridizing the two probes to various chromo- 
.somal digests. Three of 6,()(K) l^cnKV cU)ncs and one o! 5,()()() ////idlll clones 
were I'ound to hybridize with SEAGl and SEAG2, respectively. The existence ol 
a common 0.3-kb /xY;kV-///Vidin fragment between the SEAGl and SEAG2 
hybridizing plasmids, together with the fact that the two probes reacted with two 
independent luoRV fragments, indicated that the second I'xoMV fragment 
should overlap with part of the ////jdlll-library plasmid pEAIlO {.see Kesutt.s), a 
part which was therefore used It) screen the AcoRV DNA library. Three clones 
of 250 tested hybridized. 

The pEAI2()7 plasmid was constructed a.s follows. The internal 1-kb //f/idlil 
fragment from pEA120 (see Uesulls) was cloned in the conjugulive suicide vector 
pAT113 (23), giving rise to pEAI2()6. pEAII3 was obtained by subcioning the 
().75-kb toRV-//mdllI fragment from pEAl 10 (see Resulls) in pUClM digested 
by .S'/7wl and ////rdlll; a spectinomycin resistance ca.sselte, obtained by digesting 
pUC13i8 Spc (15, 22a) with Sac\, was inserted in pEA113, which was similarly 
digested, creating pEAI204. The £ajRI-////idlH fragment from pEAI2()4 was 
rendered blunt and in.serted in pEAI2t)6 at the Smal site. This plasmid, pEAI207, 
therefore harbors a spectinomycin ca.sselle Hanked by 1 and 0.75 kb t)f the coding 
region of the .v«/7 gene (i.e., between nucleotides 1005 and 1304 |see Results]). 

Hybridization techniques. Colony hybridization and Southern blotting were 
conducted as described by Maniatis el al. (13). DNA fragments and oligodc- 
oxynucleotides were radiolabeled by nick translation and terminal deoxynuclc- 
otidyl transferase, rcspccliveiy (13). 

Protein analysis. Sodium dodecyl suifatc-polyacrylaniide gel electrophoresis 
(SnS-FAGE) was performed as described by Uicmmli (II). Proteins contained 
in 1 ml of brain heart infusion medium culture supernatants were precipitated 
with 10% trichloroacetic acid prior to elcctrciphoresis. In order to establish the 
N-tcrminal sequence of the 'M-kDa protein, the proteins contained in the su- 
pernatant were ammonium sulfate precipitated at 70% saturation. The sequence 
was then determined from the soluhilized pellet with an Applied tiiosystems 
470A sequencer equipped with a model 120 phenytthlohydantoin analyzer. 

Electron microscopy. Bacteria were absorbed onto Formvar-earbon -coaled 
copper grids previously made hydrophilic by glow discharge. They were brielly 
stained with 1% ammonium molybdale. Excess slain was drawn olf with filter 
paper, and the preparation was allowed lo air dry. Observations and recording 
micrographs were performed on a JEOL 1010 clcctrt)n microscope operating 
under standard conditions at cither 60 or SO kV. For the thin sections, pellets ol 
cells harvested by centril'ugation were lixed for 30 min al room temperature with 
2.5%; glutaraldehyde in 0.1 M .sodium cacodyhite (pH 7.4) containing 0,1 M 
sucro.sc, 5 mM MgCl2, ^md 5 mM CaCU. Cells were then posllixcd for 1 h at room 



TABLE 1. N-teriTiina! sequences of proteolytic fragments 
obtained from the 94- kDa protein 





Apparent 


Peptide 


molecular 


number(s) 


mass 




(kDa) 


1" 


94 
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N-icrminal sequence" 



Ala pl y l y^ T hf P he Prp Asp V^l Pro Ala Asp 
Thr Glu Ala Ala Lys Vai Glu Ser Ala Lys 
Thr Val Asp Vai Asn Lys Vai Gly Lys Thr Glu 
Till Val Va l Leu Thr Ala Lyr . Ser Gly G l u 



" The underlined residues are identical in the N-lerminal sequence of the 
y4-kDa protein and that of the H. thurin^icnsis S-layer protein. 
The mature form of the protein. 
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FIG. 2, Restriction map of plasmids containing contiguous fragments of the 
xup gene. The vector for pEAllO, pEAI20, and pEAI30 is pUC19, whereas that 
for pEAI207 is pATI 13. RV. /uy/RV; 113, IlindHl. The dark boxes rcprcscnl the 
coding regions, and the halchetl one represents Uie spectinomycin resistance 
cassette. The probes used are imiicaled, and a scale bar is shown. 



tcnipcralure in the above-men I ioned Inilfcr, washed with water, and embedded 
in Spurr's resin. Thin sections were convcniionaliy stained with uranyl acetate 
and lead citrate and examined with a Philips CM 1 2 electron microscope oper- 
ating at SO kV (2t)). 

RESULTS 

Cloning the S-layer gene in E. coli. Since the S-layer protein 
is often the most abundant protein produced by a number of 
different bacterial species, we presupposed that a major cell 
protein wiiich migrates with an apparent molecular mass of 94 
kDa on an SDS-polyacrylamide gel could be a component of 
the B. anihmcis S-layer. This 94- kDa protein is produced by 
various B, anthracis strains, including plasmidless strains, and 
must therefore be chromosomal ly encoded. In the culture su- 
pernatants of B. anthrcu'is 9.131, the 94-kDa polypeptide may 
represent 90% of the protein in the medium (Fig. 1). This is 
not inconsistent with it being a component of the S-layer, since 
organisms like hrevis, B. {huringiemis, and Caulobacter cres- 
centus shed excessive S-layer materia! into the growth medium 
(2, 12, 22). Since B. anthracis and B. ihuringiensis are closely 
related, we initially compared the N-terminal sequence of the 
94-kDa protein with the known N-lerminal sequence of the B. 
thuringiensis S-layer protein (12). The N-terminal sequences of 
the two mature proteins were identical to one another in 7 of 
the first 8 residues (underlined in Table 1), The 94-kDa protein 
present in the culture supernatant of B. anthracis was directly 
subjected to limited chymotrypsin digestion. The N-terminal 
sequences of the resulting four major polypeptides are shown 
in Table 1. It is noteworthy that fragments 2 and 4 possessed 
the same sequence, indicating that the latter may have been 
derived from fragment 2 by C-lerminal proteolysis. 

From two of the polypeptide sequences, oligonucleotide 
probes were synthesized by using the codon bias towards A and 
T nucleotides found in known /i. anthracis genes. With these 
probes (SEAGl and SEAG2), two overlapping fragments were 
cloned from chromosomal DNA libraries, yielding the plas- 
mids pEAI20 and pEAMO (Fig. 2). pEAI30 (Fig. 2) was ob- 
tained by screening lhe£cr>RV DNA library with a fragment of 
pEAilO. The cloned gene was named sap (for surface array 
protein). 

Sequence analysis. The sequence oi sap contained one open 
r^nHino fn mie (ORF) of 2.442 nucleotides (Fi^. 3). The ORF 



was preceded by a Shine-Dalgarno sequence (AAGGACiU) 
with a calculated AG ttf —17.8 kcal. There was an 11-nucle- 
otide spacing between the last G of the Shine-Dalgarno se- 
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1 AAGCrro TAAACTTATTCTCATAAAAAAGAAGTTTGAGAGATAAGATGTATGTTTAAAAATCCTAACATTAATATA^^ 

Hin&llJ 

101 TTATTTAGTAAATGATAAGAAAA'l'AGGTATGGTAATTTTGAAAAATCTAACGCTAAGAAAAATGTCGAATTGCAaAAAAATAAG'n^^ 

2 0 1 TAATATCTCTATACTTAGATGTTGTAATTGGAAACCrrTTTTCTGTTATAG TTGTAAA TCCTTATAAC/vAGArrG TTATAAA TGT^ 

301 TCTGTCTGTTTAAAATATTCTGAriTTATAATTCATAl'CTTTTAAAATGGAAGTCGAAAA.'^CC^ 

401 'rTTAATGTTTTAAATTGCATAAAAAGGGTGTATGATTTCTTTATGATTCTATAATACGGGATATTCTATTATTCTCTATGTCGAA'^^^ 

/J M A K T N 

501 TATATCTACTAGTTTCCAAAAAAATAGAACA^'CTATTTAAAATATTATGGAACTAGTArrACATTCCT AAGGAGG AAATAT^ 

/6 SYKKV I ACTMTAAMVAGVVSPVAAAGKTP PDVP 

601 CTC™CAAAAAAG'rAA'rcGCTGGTACAATGACAGCAGCAA7GGTAGCAGGTGTTGTTTCTCCAG"rAGCAGCAGCAGGTAAAACA^^ 

/J9ADHWGI DSl NYLVEKGAVKGNDKGMFEPGKELTR 

701 GCTGATCACTGGGGAATTGATTCTATTAACTAC'n'AGTAGAAAAAGGCGCAGTTAAAGGTAACGACAAAGGAATGTTCGAGCCTGG/i^ 

/7J AEAATMMAQILMLPIDKDAKPSFADSQGQWYTP 

BOl GTGCAGAAGCAGCTACA.i\TGATGGCTCAAATCrrAAACTTACC.^ATCGATA.\AGATGCTAAJ\CCATCTTTCGCTGACTCTCAACGCCA;^^^ 
/J 05 F1AAVEKAGVIKG7GNGPEPNGK TDRVSMASLL 

901 ATTCArcGCAGCTGTAGAAAAAGCTCCCCTTAlT/iAAGG'rACAGGAAACGGCTTTGAGCCAAACGGAAAAATCGACCGCGT^ 

/iJ9VEA YK LDTKVNGT PA TK FKDL ET bNWG KEKAN I L 

1001 GTAG AAGCTT ACAAATTAGATACTAAAGTAAACGGTACTCCAGCAACTAAATTCAAAGATTTAGAAACATTAAACTGGGGTAAAGAAAAAGCTAACATCT 
Hindi I I 

/J 73 VELGI .^VGTGDQWEPKKTVTKAEAAQF lAKTDK 

1101 TAGTTGAATTAGGAATCTCTCrrCGTACTGGTGATCAATGGGAGCCTAAGAAAACTOTAACTAAAGCAGAAGCTGCTCAATTC^ 

/206 QFGTEAAKVESAKAVTTQKVEVKFKKAVEKLTK 

1201 GCAGTTCGGTACAGAAGCAGCAAAAGTTCAATCTGCAAAAGCTGTTACAACTCAAAAACTACAAGTTAAATTCAGCAAAGCTGTTGAAAAATTM 

/ 239 EDIKVTNKANNDKVLVKEVTLSEDKKSATVELYS 

±301 GA AGATATCA AAGTAACTAACAAAGCTAACAACGAT^.AAGTACTAGTTAAAGAGGTAACTTTATCAGAAGATAAAAAATCTGCTACAGTTGAATTATATA 
EcoRV 

/2V3 NLAAKQTYTVDVNKVGKTEVAVGS.LEAKTI EMA 

1401 GTAACTTACCACCTAAACAAAC1TACACTGTAGATGTAAACAAAGTTGGTAAAACAGAAGTAGCTGTAGGTTCTTTAGAAGCAAAAACAATCGAAATGGC 

/306 DQTVVADEPTALQFTVKDENGTEVVSPEGIEFV 

1501 TGACCAAACAGTTGTAGCTGATCAGCCAACAGCATTACAATTCACAGTTAA-^GATCAA/JiCCCTACTGAAGTTGTTTCACCAGAGGG 

/339 TPAAEKINAKGEITLAKGTSTTVKAVYKKDGK ^ V 

1601 ACGCCAGCTGCAGAAAAAATTAATGCAJiJUVGGTCAAATCACTTTAGCAAAAGGTACTTCAACTACTGTAAAAGCTGTTTATA^ 

/373 AP. SKEVKVS AEGAAVASISNWTVAEQNKADFTS 

17 0 1 TAGCTGAAAGTAAAGAAGTAAAAGTTTCTGCTGAAGGTGCTGCAGTAGCTTCAJ^TCTCTAACTGGACAGTTGCAGAACAAAATAAAGCTGACTTTAC 

/406 KDFKQNNKVYEGDNAYVQVELKDQFMAVTTGKV 

1801 ?AAAGATTTCAAACAAAACAATAAAGTTTACGAAGGCGACAACGCTTACGTTCAAGTAGAATTGAAAGATCAATTTAACGCAGTAACAACTGGAA^ 

/4J9EYESLNTEVAVVDKATGKVrVLi;AGKA?VKVTVK 

1901 GAATATGAGTCGTTAAACACAGAAGTTGCTGTAGTAGATAAAGCTACTGGTAAAGTAACTGTATTATCTGCAGGAAAAGCACCAGTA/i/uACTAAC^^ 

/473 DSKGKHLVSKTVEIEAF AOKAMKFIKLE K T N V A 

2001 AAGATTC AAAAGGTAAAGAACTTGTTTCAAAAACAGTTGAAATTGMSICITCGCTC AAAAAGC AATG AAAG/^ 

Hindlll 

/506 LSTKDVTDLKVKAPVLDQYGKEFTAPVTVKVLD 

2101 GCTTTCTACAAAAGATGTAACAGATTTAAAAGTAAAAGCTCCAGTACTAGATCAATACGGTA/vAGAG^TTACAGC'I'CCTGTAACAGTGA/iAGTAC'^^ 

/b'jy K D G K E L K R O K T. p. A K Y V N K E L V L N A A G C F- A C N Y T V 

2201 AAAGATGGTAAAGA ATTAAAAGAACAAAAATTAGAAGCTAAATATGTGAACAAAGAATTAGTTCTC AATGCAGCAGGTCAAGAAGCTGGTAATTATACAG 

/573 VLTAKSGEKEAKATLALELKAPGAFSKi-'EVRGL 

2301 TTGTATTAACTGCAAAATCTGGTCAAAAAGAAGCAAAAGCTACATTAGCTCTAGAA'ri^VWVGCl'CCAGGTGCATTCTCTAAAT^ 

/603 EKELDKYVTEENQKNAMTVSVLPVDANG LVLKC 

2401 AGAAAAAGAATTAGATAAJVTATGTTACTGAGGAAAACCAAAAGAATGCAATGACTGTTTCAGTTCTTCCTGTAGATGCA.AATGGATTAGTATTAAAAGGT 

/5:?5AEAAELKVTT'rNKRaKEVDATDA0VTVCNNSVIT 

2501 GCAGAAGCAGCTGAACTAAAAGTAACAACAACAAACAAAGAAGGTAAAGAAGTAGACGCAACTGATGCACAAGTTACTGTACAAAATAACAGTG7AATTA 

/673 VGQGAKAGETYKVTVVLDGKLl'l'THSFKVVDTA 

2601 CTGTTGGTCAAGGTGCAAAAGCTGGTGAAACTTATAAAGTAACAGTTGTACTAGATGGTAAATTA/iTCACMCTCATTCATTCA;^ 

/706 PTAKGLAVEFTSTSLKEVAPHADLKA ALL NILS 

'^701 ACCAACTGCTAAAGGATTAGGAGTAGAATTTACAAGCACATCTCTTAAAGAAGTAGCTCCAAATGCTGATTTAAAAGCTGCACTTTTA-^^^ 

/739 VDGVPATTA KATVSNVEFVSADTNVVA ENGTVGA 

2801 GTTGATGGTGTACCTGCGACTACAGCAAAAGCAACAGTTTCTAATGTAGAATTTGTTTCTGCTGACACAAA'1'^ 

/773 KGATSIYVKNLT vvkdgkeokve f u k a V Q V a V s 

2901 CAAAAGGTGCAACATCTATCTATGTGAAAAACCTGACAGrrGTAAAAGATGGAAAAGAGCAAAAAGTAGAAT'?TGATAAAGCTGTACAAGTTC^ 

J 806 IKEAKPATK* 

3001 TATTAAAGAAGCAAAArGTGGAAGAAAATAATTAtA AAATCATTT^A;^AAATGATTAATAAG GGCCTATGAGAAAATAGAC AAAA^TCTATG^ 

3101 G GCTTTTT ATVITGCTCTTATTTCATATACATTTGAAATTGAAAGGAGTAGCCTTATTGAGAAACAAATCAAAAGCGCT 

3201 CATTGATATGTAAATTATTTGTAAATTAACACCTTTTTTTGTTAAAAAAACATTCGAAAAGAAGAATAAAATGTCGAAATGTAATAATTTTAGTC 

3301 TAGTCTGAAAATTGTATTTATTGTAAAAAATATTTAGGTGATGAAGTGGAAATATGTGCTATAATTCCAATa'CTrGTATTCGATA^GAAA.^^ 

3401 CGCAGTGAGAGTGGTTTATAGGAAAATGGTTTGTTCATTTCTATATTTAAGTATTATCGAT 

FIG. 3. DNA and deduced amino add sequence o! the reyion encompassing \Uc.sap gene. In the lelt margin, nnmhers in nmian type indicate Ihe first nucleotide 
ol' the line, and numbers in italic type refer to the amino acid residues, lor which the one-letter code has heen used (ihe asterisk represents the stop codon). The putative 
ribosome-hinding site and promoter sequences as well as the tour short internal repeals, have heen underlined. The arrows indicate the palindromic structures. The 
////idlll and EatRV restriction sites shown in Fig. 2 are indicated. 



quence and the ATG initiation codon that is consistent with 
that observed In other gram-positive bacteria (26). 

Two palindromic structures were observed downstream 
from the TAA stop codon, the first beginning just after the 
TAA (AG = -11.6 kcal) and the second beginning 49 nucle- 
otides farther downstream (AG = -6 kcal) followed by a 
stretch of U's. These two sites may correspond to a factor- 
independent transcription termination signal. 



Translation of the ORF produced an 81 4-ami no-acid pro- 
tein with a calculated molecular mass of 86,567 Da. The se- 
quences of the chymolryplic polypeptides were localized to 
various positions throughout the protein (Fig. 3). A compari- 
son of the N terminus of Ihe mature protein (Table 1) and the 
amino acid sequence cleUucecl trom the nucleotide sequence 
(Fig. 3) indicated that this protein is synthesized as a pre- 
polypeptide with a 29-amino-acid signal peptide. The signal 



Vol. 177, 1995 



S-LAYER OF ISACILLUS ANTHRACIS 617 



371 WAESKEiiKVS 

4 90 AQKAMKEXKLE 

540 DGKELiKEQKLE 

785 VVK£GKEQKVE 

FIG. 4. Alignmcnl of the lour sliorl inlcrnal repeals of the Sap prolctn. 
klentitics (hold eharaelers) and similarilies (1, L, V, M; IX E; underlined char- 
acters) are indicated whenever two repeats share such a leal u re. 



peptide is similar to other gram-positive signal peptides, with a 
charged N terminus (3 lysine residues) followed by a hydro- 
phobic domain, a proline residue, and an alanine-rich region 
encompassing the cleavage site. The presence of the signal 
sequence implies that the mature form of the protein has a 
molecular mass of 83,720 Da. On an SDS-polyacrylamide gel, 
however, Sap migrates with an apparent molecular mass of 94 
kDa, suggesting that it may be glycosylated, as has been pre- 
viously described for other S-layer proteins (21). 

The amino acid composition of the Sap protein was com- 
pared with that of other S-layer proteins. It is noteworthy that 
all these proteins share a number of common features. For 
example, they are devoid of cysteine residues, contain very few 
methionines (0.9% in the mature form of the B. anthracis Sap 
protein), and have similar percentages of nonpolar, acidic, and 
basic residues (21). Like ih& Acetogenium kivui S-layer protein 
(16), the B. anthracis Sap protein is not very acidic, with a 
calculated pi of 6.02. This is in contrast to other S-layer pro- 
teins that tend to have a more acidic pi. 

S-layer proteins often harbor internal repeats, and two types 
were found in Sap. A short sequence was repeated four times 
throughout the second half of the protein (Fig. 3 and 4). It was 
lysine and glutamic acid rich and had a conserved motif (6 of 
11 residues were identical or similar) (Fig. 4). A longer repeat 
was present twice and was localized in the N-terminal part of 
the protein, as has been observed for other S-layer proteins 
(Fig. 5). 

Several proteins sharing similarities with Sap were found by 
using the FASTA program of the Genetics Computer Group 
package to scan the SwissProt database. The proteins with the 
best initial scores were the S-layer protein o(A. kivui (16) with 
15.3% identity and the Ompa protein of Thermotoga maritima 
(4) that exhibited 18.5% identity in a 346-amino-acid overlap. 
The greatest similarity was found between the N-terminal re- 
gions of these proteins (Fig. 5). The Pileup program was ap- 
plied to Sap and five other sequences because of shared sim- 
ilarities, as suggested by the FASTA search, or as indicated by 
other authors. The proteins compared were Sap, the A. kivui 
S-layer protein (16), the MWP of B. brevis (24), the hexago- 



nally packed intermediate layer protein of Dcinococcus radio- 
dura ns (17), the Ompa protein of T. maritima, and the outer 
wall protein of Z?. brevis (25). Pileup indicated that the /I. kivui 
S-layer protein and the B. brevis MWP are the most similar, 
immediately followed by the B. anthracis Sap and the T. mar- 
itima Ompa. Interestingly, while overall similarities were not 
considerable, the N-terminal regions aligned well. The greatest 
similarities were between regions which correspond to internal 
repeats found in the S-layer proteins and overlap the longer 
Sap interna! repeat (Fig. 5). 

Characterization of the sap gene product. To study the role 
of the S-layer structure,*a sap deletion mutant was constructed. 
The //mdlll-EcoRV fragment between nucleotides 1005 and 
1304 (pEAI 207 [Fig. 2]) was removed and replaced by a 
spectinomycin resistance cassette on a conjugative suicide vec- 
tor. Subsequently, the wild-type ^c/p gene was deleted by allelic 
exchange on the chromosome of RPIO which was subsequently 
cured of pXOl. The secreted proteins produced by the result- 
ing 5a/;-deleted strain (RBA2) were compared with those in 
the supernatant of strain 9131. On an SDS-polyacrylamide gel, 
there was an absence of the 94-kDa protein in RBA2 (Fig. 1). 

We examined the correlation between the presence of Sap 
and an S-layer around B, anthracis. The electron micrographs 
of strains 9131 and RBA2 are shown in Fig. 6. The S-layer was 
visible around the 9.131 bacterium, with a typical patterned 
ultrastructure, but was not observed around RBA2. This sup- 
ports the notion that ihc sap gene encodes a protein associated 
with the B. anthracis S-layer. Colonies of RBA2 and 9131 were 
morphologically very different. The RBA2 colonies were much 
larger than the colonies of 9131, and they tended to connect 
when streaked. When the two strains were grown in SPY liquid 
medium, the Asap strain flocculated during growth and sedi- 
mented as soon as shaking was arrested, which is different from 
the behavior of the wild type. Similar morphological changes 
were observed with .other constructed mutants deficient in Sap 
production. Analysis with an optical microscope showed that 
cells from 9131 were classical rod-shaped bacilli (=5 |jim long), 
whereas RBA2 cells were long and filamentous ( = 100 |xm 
long). The presence of these long filaments could explain the 
Hocculation observed in liquid culture and the formation of 
large colonies. It is tempting to correlate all these morpholog- 
ical differences with the variations in Sap protein quantity, i.e., 
with the presence or absence of an S-layer. 

DISCUSSION 

In a first step to study B. anthracis S-layer structure, the gene 
encoding the corresponding protein subunit was isolated on 



Bb I |yiKKVyNSViiASAIiALTVAPMAFAAEEMTTTAPKMDADMEKBiKRLEAl.GLVAGYGNGEYGV 

Ak 1 MKNLKKijIAWSTFAiiVFSAl^VGFA . . ATTPFTDVKDDAPYASAVARLYALNITNGVGDPKFGV 

Tm 1 MKEVL. - .LTVAMLSVFFSAMFA FFPDVPKDHWAY^YyWKLWQRGIFIGYPDGEFKG 

Ba 1 MAKTNSYKKVIAGTMTAAMVAGVySPVAA. . ASKTFPDVPADHWGIDSINYLVEKGAVKGNDKGMFEP 

Ba 170 YKLDTKVNGTGTGD.QiiEP 

Bb 63 DKTXTRAEFATLyVRARGIiEQGAKLAQFSN .lITDVKSHJWEAGFYtWASGEEiyKGFPDKSFKPQNQ 

Ak 64 DQPiiTRAQMITFyNRjaLGYEDLA£;MAK.SEKSAFKDVPQNHWAVGQINLAYKLGLAQGVGNGKFJ2PNSE 

Tm 55 DEYITRYEAATAySRLL^FIEQKMLAG . ASCDLAQV . VGHLSDKXMALEEKVNGLTGILDTL . . . ASQ 

Ba 67 GKELTRAEAAT^mAQILhlLPIDKC AKP^FADS . QGQWXTPFIAAVEKAGVIKGTGNG . FEPNGK 

Sa 188 KKTYTJSAEAAQFIAKTDKQFGTEA AKVESAKA 

FIG. 5. Qimparison ol lhc N-lcrmiiuil rcjjion nf tlic Sap prnlcin wilh ihal ol'lwt) other S-kiycr pntlcins unci tine related prolcin.Th c four seLpienees {H. brevis MWP, 
lib, A. kivui S-layer protein. Ak; 7! maritinm ihnpa protein. 7'm; aiul ti. anttiraas .Sap protein, isa) were uligiial hy iiilUHlinjint; g aps (Uni.s). T h e Sa p prote i n intcrnul 
repeal, starling at re.siLliie 170. was adJed, For Ihc .sake ol elarity, the two 11 unihrucis sequences have heen grouped at the holtoni nl the ligure, whereas the Pileup 
program had inserted the N-lerminal sequence ol Sap between the/1, kivui and the T. /naritima sequences. The signal peptide cleavage sites (arrow) have been aligned. 
Idenlily and similarily (Fig. 4; also. A, G; D, E, N, O; F, W, Y; S. T. K, K) were established in coniparison with the N-lerinnial portion ol the Sap protein. 
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FIG. 6. Electron micrographs oflhc cell walls ot R anihracis cells. Cells were ncgalivcly stained with ammonium molylxlale. (a) Sap^ strain 9131; (b) Sap" mutant 
strain KBA2; (c) ultrathin section of intact cell ol'Sap"^ strain 9131; (lI) ultra thin section ol intact cell ol Sap" mutant strain KliA2. s, S-layer; pg, pcptidoglycan; cm, 
cytoplasmic membrane. Bar, 0.1 ^.m. 



~ B, btem cell wal l p roteins, lack s th e -35 sequenre (i) Thf , 
absence of a consensus sequence in the vicinity of the 
known B. anthracis genes could be due to altogether different 
cr factor recognition sequences. This would be unusual, how- 



ever, since phylogenetically more distant organisms, such as E. 
coli and B. subtUis, share the same consensus sequences. An- 
other explanation is that these B, anihracis genes are not tran- 
scribed by a major cr factor, although less utilized u factor 
recognition sequences were not found. The hypothesis that we 
favor is that these genes are positively regulated and that, as 
with such genes from B. subiilis, an activator compensates for 
the poor recognition sequence. The regulatory region of the B. 
anthracis sap gene awaits further analysis. 

The amino acid sequence of Sap was compared with those of 
other vS-layer proteins. Sequence similarities were found for 
the N-terminal regions of Sap, iwo other S-layer proteins 
(MWP from B. brevis and the S-layer protein from A. kivui), 
and the T. maritima Ompa protein. These sequences contain 
internal repeats that could play an important role in the as- 
sembly of the layer. Interestingly, proteins tmplicated m the 
attachment of Clostridium, thermocellum cellulosome to the 
membrane also have similarities with the N-terminal repeats 



two contiguous DNA fragments. Oligonucleotide probes de- 
rived from established amino acid sequences of the 94-kDa 
major extracellular protein were successfully used to clone the 
corresponding gene (sap). Sequence data and deletion analysis 
support the notion that the S-layer comprises the 94-kDa pro- 
tein. 

Sequences resembling the recognition sequences of Bacillus 
subtilis (T^ factor were looked for in the region upstream from 
this ORF, and none which were really convincing were found. 
This absence is not entirely surprising. Such sequences have 
not been found for B. anthracis promoters where transcription 
is initiated at a high level, for example, upstream from a toxin 
component gene (10). Moreover, the promoter, which is prob- 
ably responsible for the constitutive high-level synthesis of the 
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FIG, (\ — Continued. 



from the S-layer proteins (7). Therefore, these repeat regions 
may be important in the interaction between the peptidogiycan 
wall and the S-layer structure. iVloreover, a highly charged 
sequence was found to be repealed four times in Sap. It could 
have an important structural role. It has been suggested that 
the r. maritima Ompa protein connects the outer membrane 
to the inner ceil body, with its carboxy-terminal hydrophobic 
tail probably being buried in the membrane. Therefore, the B. 
anthracis Sap protein may anchor the capsule to the pepti- 
dogiycan wall, utilizing both its repeated sequences and its 
hydrophobic C-terminal tail. 

The function and the in vivo expression of the B, anthracis 
S-layer remain unknown. In the absence of the S-layer, dra- 
matic morphological changes are observed, under both liquid 
and solid culture conditions. Since synthesizing the S-layer is 
energy-consuming, it would be interesting to determine where 
the pressure to maintain it occurs: in the soil or in the host. A 
major cell-associated protein with a molecular mass of 91 kDa 
(EAi) has been reported to be a predominant antigen follow- 
ing vaccination with the live spore vaccine (5). It should now be 
of interest to determine if EAI corresponds to the Sap protein. 
If the host is where the S-layer is maintained, its role cannot be 
simply to protect against complement killing or to enhance the 
binding of the bacterium to host molecules or macrophages, 
< ; in rr. i n v i vn thn h;intnrin nre pnrMp si il ated . T herefore, al- 
though B. anthracis probably goes through few, if any, life 
cycles in the soil, an eventual protective role against osmotic 
pressure could exist in this environment where the S-layer 



could be the outermost wall structure. In the future, an exten- 
sive study of the synthesis of the S-layer in vivo as well as that 
of an eventual interaction between the S-layer and the capsule 
will have to be undertaken. 
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